Turning Speech Into Scripts

نویسندگان

  • Manny Rayner
  • Beth Ann Hockey
  • Frankie James
چکیده

We describe an architecture for implementing spoken natural language dialogue interfaces to semi-autonomous systems, in which the central idea is to transform the input speech signal through successive levels of representation corresponding roughly to linguistic knowledge, dialogue knowledge, and domain knowledge. The final representation is an executable program in a simple scripting language equivalent to a subset of CSHELL. At each stage of the translation process, an input is transformed into an output, producing as a by-product a “meta-output” which describes the nature of the transformation performed. We show how consistent use of the output/meta-output distinction permits a simple and perspicuous treatment of apparently diverse topics including resolution of pronouns, correction of user misconceptions, and optimization of scripts. The methods described have been concretely realized in a prototype speech interface to a simulation of the Personal Satellite Assistant.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Script training treatment for adults with apraxia of speech.

PURPOSE Outcomes of script training for individuals with apraxia of speech (AOS) and mild anomic aphasia were investigated. Script training is a functional treatment that has been successful for individuals with aphasia but has not been applied to individuals with AOS. Principles of motor learning were incorporated into training to promote long-term retention of scripts. METHOD Three individu...

متن کامل

Application of Psychological Characteristics to D-Script Model for Emotional Speech Processing

D-scripts model is originally developed for description of affective (emotional) mass media texts and with extension also applies to emotional speech synthesis. In this model we distinguish units for “rational” inference (r-scripts) and units for “emotional” processing of meaning (d-scripts). Basing on a psycholinguistics study we demonstrate relations between classes of emotional utterances in...

متن کامل

A Contrastive Analysis of Persian and English Compliment, Request, and Invitation Patterns within the Semantic Metalanguage Framework

Speech acts shape the core of pragmatic competence and their mastery is a prerequisite for successful dis- course encounters in an L2. Cross-cultural comparisons, as claimed by Johnstone (2018), are very effec- tive for knowing and acquiring the speech acts. Accordingly, the purpose of this study was to compare a limited number of communicative routines in English and Persian within the f...

متن کامل

A Review on Multilingual Text to Speech Synthesis by Syllabifying the Words of Devanagari and Roman

Speech synthesis is process of spoken language as an input text and converted into speech waveforms. This paper describes the text to speech system for Devanagari scripted language and Roman Language. There are many earliest TTS systems are available but for Devanagari and Roman scripts are not available.

متن کامل

Feasibility of constructing an expressive speech corpus from television soap opera dialogue

This paper presents a study into the feasibility of extracting a corpus of expressive speech from television soap opera dialogue. We investigated how dialogue can be extracted from television production tapes, and what kind of signal quality may be expected. We analysed to what extent the scripts that are used in television production can provide a transcription of the actual dialogue. From the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره cs.CL/0006017  شماره 

صفحات  -

تاریخ انتشار 1999